Add dlrm_v2 CPU FP8 QDQ example #2239

mengniwang95 · 2025-07-15T08:09:06Z

Type of Change

example

Description

Add dlrm_v2 CPU FP8 QDQ example
depend on #2238

fp32: 0.8031
FP8: 0.8080

Signed-off-by: Mengni Wang <[email protected]>

examples/3.x_api/pytorch/recommendation/dlrm_v2/fp8_quant/cpu/README.md

thuang6 · 2025-07-23T03:59:07Z

examples/3.x_api/pytorch/recommendation/dlrm_v2/fp8_quant/cpu/dlrm_model.py

+        return self.crossnet(concat_dense_sparse)
+
+
+class IPEX_DLRM_DCN(DLRM_DCN):


why name it IPEX_XXX, I don't see any dependency to IPEX. better rename it

thuang6 · 2025-07-23T04:04:44Z

examples/3.x_api/pytorch/recommendation/dlrm_v2/fp8_quant/cpu/main.py

+    model = construct_model(args)
+    model.model.sparse_arch = model.model.sparse_arch.bfloat16()
+
+    qconfig = FP8Config(


I know Linear is in default quantization op list. how does EmbeddingBag include in quantization op list? we add it to default?

yes, currently cpu device supports Conv, Linear and EmbeddingBag fp8 quant by default.

neural-compressor/neural_compressor/torch/algorithms/fp8_quant/_core/patching_common.py

Line 139 in d04fb1c

PATCHED_MODULE_TABLE["cpu"].update({"Linear": ModuleInfo("linear", PatchedLinear),

Add dlrm_v2 CPU FP8 QDQ example

7512429

Signed-off-by: Mengni Wang <[email protected]>

XuehaoSun requested changes Jul 16, 2025

View reviewed changes

examples/3.x_api/pytorch/recommendation/dlrm_v2/fp8_quant/cpu/README.md Outdated Show resolved Hide resolved

mengniwang95 added 2 commits July 18, 2025 13:33

Update README.md

ffe6850

Merge branch 'master' into mengni/dlrmv2

a18b753

XuehaoSun approved these changes Jul 21, 2025

View reviewed changes

mengniwang95 requested review from xin3he and thuang6 July 22, 2025 06:45

chensuyue added this to the 3.5 milestone Jul 22, 2025

thuang6 reviewed Jul 23, 2025

View reviewed changes

mengniwang95 added 3 commits July 24, 2025 14:57

Update dlrm_model.py

8819171

Update main.py

d3d7dec

Merge branch 'master' into mengni/dlrmv2

f876626

thuang6 approved these changes Jul 24, 2025

View reviewed changes

XuehaoSun merged commit 1ab2011 into master Jul 25, 2025
11 checks passed

XuehaoSun deleted the mengni/dlrmv2 branch July 25, 2025 02:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add dlrm_v2 CPU FP8 QDQ example #2239

Add dlrm_v2 CPU FP8 QDQ example #2239

Uh oh!

mengniwang95 commented Jul 15, 2025

Uh oh!

Uh oh!

thuang6 Jul 23, 2025

Uh oh!

mengniwang95 Jul 24, 2025

Uh oh!

thuang6 Jul 23, 2025

Uh oh!

mengniwang95 Jul 24, 2025

Uh oh!

Uh oh!

Uh oh!

		return self.crossnet(concat_dense_sparse)


		class IPEX_DLRM_DCN(DLRM_DCN):

Add dlrm_v2 CPU FP8 QDQ example #2239

Add dlrm_v2 CPU FP8 QDQ example #2239

Uh oh!

Conversation

mengniwang95 commented Jul 15, 2025

Type of Change

Description

Uh oh!

Uh oh!

thuang6 Jul 23, 2025

Choose a reason for hiding this comment

Uh oh!

mengniwang95 Jul 24, 2025

Choose a reason for hiding this comment

Uh oh!

thuang6 Jul 23, 2025

Choose a reason for hiding this comment

Uh oh!

mengniwang95 Jul 24, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!